Time-compressing natural and synthetic speech

نویسنده

  • Esther Janse
چکیده

Phoneme detection is a useful tool to compare the perception of perfectly intelligible speech types. As previous research suggests that perception of fast speech is helped by segmental redundancy, we expected the hyperarticulation of synthetic speech to turn into an advantage at a fast rate. Consequently, the processing advantage of natural over synthetic speech was expected to decrease after time-compression. Secondly, detection times were expected to be slower after moderate timecompression because of the higher processing difficulty of fast speech. However, detection times tended to become shorter in the time-compressed condition. This was attributed to shorter durations of syllables and words. Furthermore, the processing advantage of natural over synthetic speech did not decrease, but rather tended to increase. This may be explained by the lack of a speaking effort pattern in synthetic diphone speech, which makes it rather blurred at faster playback rates.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognition of time-compressed speech does not predict recognition of natural fast-rate speech by older listeners.

This study investigated whether recognition of time-compressed speech predicts recognition of natural fast-rate speech, and whether this relationship is influenced by listener age. High and low context sentences were presented to younger and older normal-hearing adults at a normal speech rate, naturally fast speech rate, and fast rate implemented by time compressing the normal-rate sentences. R...

متن کامل

The Temporal Delay Hypothesis: Natural, Vocoded and Synthetic Speech

Including disfluencies in synthetic speech is being explored as a way of making synthetic speech sound more natural and conversational. How to measure whether the resulting speech is actually more natural, however, is not straightforward. Conventional approaches to synthetic speech evaluation fall short as a listener is either primed to prefer stimuli with filled pauses or, when they aren’t pri...

متن کامل

The effect of filled pauses and speaking rate on speech comprehension in natural, vocoded and synthetic speech

It has been shown that in natural speech filled pauses can be beneficial to a listener. In this paper, we attempt to discover whether listeners react in a similar way to filled pauses in synthetic and vocoded speech compared to natural speech. We present two experiments focusing on reaction time to a target word. In the first, we replicate earlier work in natural speech, namely that listeners r...

متن کامل

Voice Onset Time and the Perception of Japanese Voicing Contrasts

Much crosslinguistic research exists on the production and perception of voice onset time (VOT). However, most research on the perception of VOT uses synthetic stimuli instead of natural speech stimuli. Effects of synthetic speech on the perception of VOT are not known, but more research needs to be done to see if there are differences between perception using synthetic speech and perception us...

متن کامل

Perception of Speech Rate and Naturalness in Synthetic Slow Speech

This paper details two perception experiments based on synthetic British English obtained with CART models predicting phone durations in slow speech from normal speed speech. Speech rate and naturalness were assessed by 6 English natives. Synthetic slow speech was rated as both slower and less natural than natural slow speech; however, the insertion of the pauses produced in natural slow speech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002